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In order to resolve a significant uncertainty in the heat of vaporization 
of silicon - - a fundamental parameter in gas-phase thermochemistry 
AiT2 [Si(<?)] has been determined from a thermochemical cycle involving the 

o _ 

precisely known experimental heats of formation of SiF^g) and F(g) and a 
ON 

CO \ benchmark calculation of the total atomization energy (TAEo) of SiF4 us- 

• i— ( | 

ing coupled-cluster methods. Basis sets up to [8s7p6d4:f2glh] on Si and 

O [7s6p5d4/352/i] on F have been employed, and extrapolations for residual 



X 



basis set incompleteness applied. The contributions of inner-shell correlation 
(—0.08 kcal/mol), scalar relativistic effects (—1.88 kcal/mol), atomic spin- 
orbit splitting (—1.97 kcal/mol), and anharmonicity in the zero-point energy 
(+0.04 kcal/mol) have all been explicitly accounted for. Our benchmark 
TAE =565.89±0.22 kcal/mol leads to Aif£ [Si(g)]=107.15±0.38 kcal/mol 
(Afl£ 2B8 [Si(g)]=108.19±0.38 kcal/mol): between the JANAF/CODATA 
value of 106.5±1.9 kcal/mol and the revised value proposed by Grev and 
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Schaefer [J. Chem. Phys. 1992, 97, 8389], 108.1±0.5 kcal/mol. The revision 
will be relevant for future computational studies on heats of formation of sil- 
icon compounds. Among standard computational thermochemistry methods, 
G2 and G3 theory exhibit large errors, while CBS-Q performs relatively well 
and the very recent Wl theory reproduces the present calibration result to 
0.1 kcal/mol. 

I. INTRODUCTION 

For three of the first-and second-row elements, namely Be, B, and Si, the tabulated heats 
of formation of the atoms in the gas phase carry experimental uncertainties in excess of 1 
kcal/mol. Aside from being propagated into uncertainties for experimental gas-phase ther- 
mochemical data for compounds involving these elements, they adversely affect the accuracy 
of any directly computed heat of formation -- be it ab initio or semiempirical — of any Be, 
B, or Si-containing compounds through the identity 

AH° f)T (X k YiZ m . ..)- kAH° fT (X) - lAH° fT (Y) - mAH° fT (Z) - ... 
= £ T (X fc Y,Z m . . . ) + RT{1 -k-l-m-...)- kE T {X) - IE T (Y) - mE T (Z) -... (1) 

Particularly given the importance of boron and silicon compounds, this is a rather unsatis- 
factory state of affairs. 

Recently we succeeded [ij in reducing the uncertainty for boron by almost an order of 
magnitude (from 3 kcal/mol to 0.4 kcal/mol) by means of a benchmark calculation of the 
total atomization energy (TAEo) of BFa(g). By combining the latter with the experimen- 
tally precisely known [[J heat of formation of BF 3 , we were able to indirectly obtain the 
vaporization enthalphy of boron to high accuracy. It was thus shown that a 1977 experiment 
by Storms and Mueller [0], which was considered an outlier by the leading compilation of 
thermochemical tables ||, was in fact the correct value. 

The heat of formation of Si(g) is given in the JANAF H as well as the CODATA M 



tables as 106.5±1.9 kcal/mol. Desai || reviewed the available data and recommended the 
JANAF/CODATA value, but with a reduced uncertainty of ±1.0 kcal/mol. Recently, Grev 
and Schaefer (GS) found that their ab initio calculated TAE[SiH 4 ], despite basis set 
incompleteness, was actually larger than the value derived from the experimental heats of 
formation of Si(g), H(g), and SiH 4 (g). They suggested that the heat of vaporization of 
silicon be revised upwards to A_£/y [Si(g)]=108.07±0.50 kcal/mol, a suggestion supported 
by Ochterski et al. @. 

The calculations by GS neglected relativistic contributions, which were very recently 
considered by Collins and Grev (CG) ||. Using relativistic (Douglas-Kroll @) coupled- 
cluster methods, these authors found that the TAE of SiH 4 contains a relativistic con- 
tribution of —0.67 kcal/mol. Combined with the earlier calculations of GS, this yields 
Ai/j [Si(g)]=107.4±0.5 kcal/mol, within Desai's reduced error bar. However, as discussed 
there ||, the experimental data for silane, SiH 4 , involve an ambiguity. The JANAF heat 
of formation of silane, 10.5±0.5 kcal/mol is in fact the Gunn and Green |I(J measurement 



of 9.5 kcal/mol increased with a correction ]TT| of +1 kcal/mol for the phase transition 



Si(amorphous)— >Si(cr), which was considered an artifact of the method of preparation by 
Gunn and Green. If one were to accept their argument, the GS and CG calculations on SiH 4 
would actually support the original JANAF/CODATA AH° ffi [Si(g)]. 

No such ambiguities exist for tetrafluorosilane, SiF 4 , for which a very accurate experimen- 
tal heat of formation has been determined |12| by direct combination of the pure elements 
in their respective standard states in a fluorine bomb calorimeter. Johnson's |1^] heat of for- 



mation at 298.15 K, — 386.18±0.11 kcal/mol, is slightly higher in absolute value and slightly 
more precise than the CODATA value of — 386.0±0.2 kcal/mol, itself based on an earlier 
experiment from the same laboratory [13| . 

Clearly, if a benchmark quality (preferably ±0.3 kcal/mol or better) TAE[SiF 4 (g)] could 
be calculated, then an unambiguous redetermination of AHj [Si(g)} would be possible. Our 
previous study on BF 3 being at the limit of the then available computational hardware, 



a similar study on SiF4 - - which contains an additional heavy atom and eight additional 
valence electrons, leading to an expected increase in CPU time and memory requirements by 
a factor of about 3.7 (see below) -- could only be completed most recently, and is reported 
in the present contribution. 

II. METHODS 

Most electronic structure calculations reported here were carried out using MOLPRO 



97.3 |14j] running on SGI Octane and SGI Origin 2000 minisupercomputers at the Weizmann 
Institute of Science. The very largest calculation, a full-valence coupled-cluster calculation 
involving 620 basis functions, was carried out on the National Partnership for Advanced 
Computational Infrastructure CRAY T90 at the San Diego Supercomputer Center. 

As in our previous study on BF 3 , all electron correlation calculations involved in deter- 
mining the valence and inner-shell correlation contributions to TAE were carried out using 



the CCSD (15| and CCSD(T) [0,[n]] coupled-cluster methods. (For the energies of the con- 
stituent atoms, the definition of Ref. |17| for the open-shell CCSD(T) energy was employed.) 
Both the very low 7^ diagnostic |jj| of 0.012, and inspection of the largest coupled-cluster 



amplitudes, suggest a system essentially totally dominated by dynamical correlation. From 



experience it is known JT9[ that CCSD(T) yields results very close to the exact (full config- 
uration interaction) basis set correlation energy under such circumstances. 

Basis set limits for the SCF and valence correlation limits were extrapolated (see below for 
details) from calculated results using the (A)VTZ+2dlf, (A)VQZ+2dlf, and (A)V5Z+2dlf 
basis sets. For silicon, those basis sets consist of the standard Dunning correlation consistent 



20| , |2T[| cc-pVTZ, cc-pVQZ, and cc-pV5Z basis sets augmented with two high-exponent d 



and one high-exponent / functions with exponents obtained by progressively multiplying 
the highest exponent already present by a factor of 2.5. The addition of such 'inner shell 
polarization functions' |22[ has been shown |22"|-f2"5|j to be essential for smooth basis set 
convergence in second-row compounds, particularly those containing highly polar bonds 



such as SiF4 ||26|| . (It should be recalled that inner shell polarization is a pure SCF effect 



and bears little relationship to inner shell correlation. In the present case of SiF 4 , the 
contribution of the inner polarization functions to the SCF/(A)VTZ+2dlf TAE was found 
to be no less than 9.81 kcal/mol.) For fluorine, the basis sets given correspond to Dunning 
(diffuse function)-augmented correlation consistent |27] aug-cc-pVTZ, aug-cc-pVQZ, and 
aug-cc-pV5Z basis sets --it was shown repeatedly (e.g. |28|1 ) that the use of augmented 
basis sets on highly electronegative elements such as F in polar compounds is absolutely 
indispensable for accurate binding energies. The final basis sets for SiF4 involve 235, 396, 
and 620 basis functions, respectively, for (A)VTZ+2dlf, (A)VQZ+2dlf, and (A)V5Z+2dlf. 
The geometry of SiF 4 was optimized by repeated parabolic interpolation at the 
CCSD(T)/cc-pVQZ+l level, where the suffix '+1' stands for the addition of a tight d func- 
tion with an exponent |24| of 2.082 on Si. In previous work on H 2 SiO [25], one of us found 



that this recovers essentially all of the inner polarization effect on the molecular geometry. 
The bond length thus obtained, r e [SiF 4 ]=l. 56043 A, was used throughout this work. (For 
comparison, the experimental r =l. 5598(2) A [J29| ; to our knowledge, no experimentally 
derive r e is available.) 

The inner-shell correlation contribution was determined by comparing the computed 
binding energies correlating all electrons except Si(ls), and correlating only valence electrons, 
using the MTsmall basis set [pj . The latter is a variant of the Martin- Taylor core correlation 



basis set [pl| , p2| in which the very tightest p, d, and / functions were deleted at no significant 
loss in accuracy on the contributions to TAE. 

The scalar relativistic contributions were obtained as expectation values of the first-order 



Darwin and mass-velocity operators @,[34]] at the ACPF (averaged coupled-pair functional 



|35|j) level using the MTsmall basis set. All electrons were correlated in this calculation, 
and it should be noted that the MTsmall basis set is completely uncontracted and therefore 
flexible enough in the s and p functions for this purpose. For the sake of illustration, this 
approach yields —0.67 kcal/mol for SiH 4 , identical to two decimal places with the more 



rigorous relativistic coupled-cluster value 0. 

The contribution of atomic spin-orbit splitting derived from the experimental atomic fine 
structures |36| of Si( 3 P) and F( 2 S*) is —1.968 kcal/mol. For comparison, we also carried 
out all-electron CASSCF/CI spin-orbit calculations |J7[] using the spdf part of a completely 
uncontracted aug-cc-pV5Z basis set, augmented with a single tight p, three tight d, and 
two tight f functions in even-tempered series with ratio 3.0. In this manner, we obtain a 
contribution of —1.940 kcal/mol. In short, to the accuracy relevant for this purpose it is 
immaterial whether the computed or the experimentally derived value is used. 

The zero-point energy was obtained from the experimentally derived harmonic frequen- 
cies and anharmonicity constants of McDowell et al. ||29| . This leads to a value of 8.029 
kcal/mol, whereas one would obtain 8.067 kcal/mol from one-half the sum of the harmonic 
frequencies, J2i diUii/2 and 7.975 from one-half the sum of the fundamentals, J2i diVi/2. The 
approximation Y^idi(uj{ + ^i)/4, at 8.021 kcal/mol, yields essentially the exact result. 

III. RESULTS AND DISCUSSION 

All relevant data are given in Table 1. 

As expected, the SCF contribution of TAE converges quite rapidly. We have shown 
previously |38| that the SCF convergence behavior is best described by a geometric extrap- 



olation A + B/C n of the type first proposed by Feller |p9 |, with extrapolation from the TAE 
contributions to be preferred over extrapolation from the constituent total energies. From 
the (A)VTZ+2dlf, (A)VQZ+2dlf, and (A) V5Z+2dlf results, i.e. Feller (TQ5), we obtain a 
basis set limit of 448.43 kcal/mol, 0.02 kcal/mol more than the SCF/(A)V5Z+2dlf result 
itself. An extrapolation from the (A) VDZ+2d, (A) VTZ+2dlf, and (A) VQZ+2dlf basis sets 
would have yielded 448.47 kcal/mol, an increment of 0.22 kcal/mol over the (A)VQZ+2dlf 
result. 

Given the large number of valence electrons, connected triple excitations account for 
a rather small part of the binding energy: 9.61 kcal/mol at the CCSD(T)/(A)VQZ+2dlf 
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level, compared to a CCSD valence correlation contribution of 114.85 kcal/mol and an SCF 
contribution of 448.25 kcal/mol. Since a CCSD(T)/(A)V5Z+2dlf calculation is beyond the 
limits particularly of memory and available CPU time for this system, this suggests an 
approach in which only the CCSD valence correlation contribution be obtained from the 
largest basis set, while the (T) contribution is obtained from an extrapolation on smaller 
basis sets. Indeed, Martin and de Oliveira (MdO) recently found in a systematic study 
|30| on a wide variety of first-and second-row molecules that this essentially does not affect 
the quality of the results, except when the (T) contribution is a dominant component to 
the binding energy. Helgaker and coworkers [[|(J previously noted the more rapid basis set 
convergence behavior of connected triple excitations as compared with the CCSD correlation 
energy. 

The CCSD/(A)V5Z+2dlf calculation required over 3GB of memory, some 120 GB of disk 
space, and 43 hours of real time (81 hours of CPU time) running on 8 CPUs of the NPACI 
CRAY T90. (Close to 99% parallellism was achieved in the CCSD code simply by adapting 
it to use vendor- supplied parallel BLAS and LAPACK libraries.) To our knowledge, this is 
the largest coupled-cluster calculation ever carried out using a conventional algorithm. 

We have considered two extrapolation formulas based on the asymptotic behavior of 
pair correlation energies [f|l|,£H], namely the 3-point extrapolation A + B/(l + l/2) a due to 
Martin, and the 2-point extrapolation A + B/l 3 formula due to Helgaker and coworkers [f43| . 
(In both formulas, / stands for the maximum angular momentum present in the basis set.) 
MdO found |Kj that both formulas tend to predict the same basis set limit if extrapolated 



from sufficiently large basis sets, but that the limits predicted by the A + B/l 3 formula 
are much more stable with respect to reduction of the sizes of the basis sets used in the 
extrapolation. This is at least in part related to the fact that the three-point extrapolation 
involves, of necessity, one value with an even smaller I than the two-point extrapolation. 

As an illustration, let us consider the BF diatomic which was used to refine the BF3 result 
|TJ. From the three-point A + B/(l + 1/2) Q extrapolation applied to AVnZ (n=3,4,5) valence 



correlation contributions to D e , we obtain 38.35 kcal/mol, compared to 38.76 kcal/mol for 
AVnZ (n=4,5,6). In contrast, a A + B/l 3 extrapolation applied to AVnZ (n=Q,5) yields 
38.78 kcal/mol, just like AVnZ (n=5,6) does; application to AVnZ (n=T,Q) results yields 
an overestimate of 39.08 kcal/mol. 

In the present case, the A+B/l 3 formula predicts a CCSD limit contribution to TAE[SiF 4 ] 
of 119.28 kcal/mol from the (A)VQZ+2dlf and (A)V5Z+2dlf results, with the extrapola- 
tion accounting for 2.27 kcal/mol of the final result. For comparison, extrapolation from 
two smaller basis sets, (A)VTZ+2dlf and (A)VQZ+2dlf, yields 119.62 kcal/mol, while the 
A + B/{1 + l/2) a formula applied to all three values yields a much smaller value of 118.87 
kcal/mol. 

The (T) contribution is computed as 9.11 and 9.61 kcal/mol, respectively, in the 
(A)VTZ+2dlf and (A)VQZ+2dlf basis sets: assuming A + B/l 3 behavior, this extrapo- 
lates to a limit of 9.98 kcal/mol. We thus finally find a basis set limit valence correlation 
contribution of 129.26 kcal/mol. 

As expected, the Si(2s,2p) and F(ls) inner-shell correlation energy is quite substantial 
in absolute terms, accounting for some 28% of the overall correlation energy excluding the 
very deep Si (Is) core. As we have seen in the past for second- row molecules, however, the 
differential contribution to TAE nearly cancels, in this case being only +0.08 kcal/mol. This 
contribution is definitely dwarfed by that of scalar relativistic effects, which as we noted we 
compute to be —1.88 kcal/mol. 

Combining all of the above with the atomic spin-orbit correction noted in the Methods 
section, we finally obtain a "bottom-of-the-well" TAE e of 573.92 kcal/mol; combined with 
the experimentally derived ZPE, we obtain TAEo=565.89 kcal/mol. 

Combining this with the CODATA heats of formation of F(g) and SiF 4 (g), we finally 
obtain AHj [Si(g)] = 107 .34 kcal/mol. Using the more recent A//? [SiF 4 (g)] instead, this 
value is reduced to 107.15 kcal/mol. 

In order to make an assessment of the probable error in these values, we should consider 



both the uncertainty in the calculated TAEo and the propagated experimental uncertain- 
ties in Ai7? [SiF 4 (g)] and Ai?£ [F(g)]. Using exactly the same method as we have em- 
ployed, MdO obtained a mean absolute error of 0.22 kcal/mol for a wide variety of first-and 
second-row molecules, which dropped as low as 0.16 kcal/mol when some molecules with 
significant nondynamical correlation effects were eliminated. Erring on the side of caution, 
we assign 0.22 kcal/mol as a standard deviation rather than an upper limit to the error. 
Given uncertainties of 0.07 and 0.20 kcal/mol in the CODATA heats of formation for F(g) 
and SiF 4 (g), respectively, we obtain 107.34±0.41 kcal/mol for AiJ2 [Si(g)]. Employing the 
more recent Johnson [0 AifSfSiF^g)] instead, which has a smaller uncertainty, we propose 
Aif£ [Si(g)] = 107.15±0.38 kcal/mol as our final estimate. (At 298.15 K, using the CODATA 
H 2 Q8 — Hq functions, this corresponds to 108.19±0.38 kcal/mol.) 

Our final estimate is in fact within the reduced error limits of Desai ||, 
AiJ? [Si(g)] = 106.5±1.0 kcal/mol. It agrees to within combined uncertainties with the GS 
value after applying CG's relativistic correction, 107.4±0.5 kcal/mol, which suggests that the 
'spurious' Si(cr)^Si(amorph) transition enthalpy discussed in the introduction may indeed 
have been a fair estimate. In previous calculations 0,0 on SiF 4 and SiCl 4 , respectively, 
Bauschlicher and coworkers derived values of 107.5±2 and 107.8±2 kcal/mol, respectively, 
in which the error bars are very conservative. In the context of a review article |H| on high- 
accuracy theoretical thermochemistry, Martin recently repeated the GS calculation on SiH 4 
using techniques similar to those employed here, and obtained a TAE [SiH 4 (g)] consistent 
with Ai/? [Si(g)] = 107.55±0.5 kcal/mol if the Si(cr)^Si(amorph) phase transition enthalpy 
was indeed included. We conclude that all data support a slight increase in Ai?£ [Si(g)] to 
the 107.15±0.38 kcal/mol value proposed in the present work. 

As a final note, we consider the performance of some 'standard' theoretical thermochem- 
istry methods for this molecule, compared to our benchmark TAE e =573.92±0.22 kcal/mol. 
As noted previously |[46|| , G2 theory |I7| fails dismally, underestimating TAEo by 8.2 kcal/mol 
even as both spin-orbit splitting and scalar relativistics were neglected, which would together 



have increased the gap by a further 3.85 kcal/mol. G3 theory |48[ represents a substantial 
improvement, being 2.2 kcal/mol below our value including spin-orbit corrections: applying 
the scalar relativistic correction to their value (or, equivalently, deleting it from our own cal- 
culation) would however increase that gap to a still substantial 4.1 kcal/mol. Interestingly, 
both CBS-Q and CBS-QB3 (E| predict much higher values, 576.0 and 577.0 kcal/mol, re- 



spectively. Neither value includes spin-orbit or relativistic corrections: upon applying them, 
we find that they underestimate our best result by only —1.8 and —0.8 kcal/mol, respec- 
tively. Finally, the Wl theory very recently proposed by Martin and de Oliveira |]30| yields 
a value of 573.85 kcal/mol, only 0.07 kcal/mol below the present calibration result. (Wl 
theory includes both scalar relativistic and spin-orbit corrections as standard parts of the 
method.) 

IV. CONCLUSIONS 

From an exhaustive ab initio calibration study on the SiF4 molecule, we obtain a total 
atomization energy at K of 565.89±0.22 kcal/mol. This value includes rather substan- 
tial scalar relativistic (—1.88 kcal/mol) and atomic spin-orbit (—1.97 kcal/mol) effects, as 
well as more minor effects of inner-shell correlation (—0.08 kcal/mol) and anharmonicity in 
the zero-point energy (+0.04 kcal/mol). In combination with experimentally very precisely 
known heats of formation of F(g) and SiF^g), we obtain AiJ? [Si(g)] = 107.15±0.38 kcal/mol 
(Aifj 298 [Si(g)]=108.19±0.38 kcal/mol). This confirms the suggestion of Grev and Schaefer 
that the rather uncertain JANAF/CODATA value of 106.5±1.9 kcal/mol should be re- 
vised upward, albeit to about 1 kcal/mol lower than their suggested 108.1±0.5 kcal/mol. 
The revision will be relevant for future computational studies on heats of formation of sil- 
icon compounds. Among standard computational thermochemistry methods, G2 and G3 
theory exhibit large errors, while CBS-Q performs relatively well and the very recent Wl 
theory reproduces the present calibration result to 0.1 kcal/mol. 
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TABLES 
TABLE I. Computed thermochemical properties for S1F4 and Si in the gas phase. All values 

are in kcal/mol 

Components of TAE 



SCF 



CCSD-SCF 



CCSD(T)-CCSD 



(A)VDZ+2d 

(A)VTZ+2dlf 

(A)VQZ+2dlf 

(A)V5Z+2dlf 

Extrap.{D,T,Q} 

Extrap.{T,Q,5} 



429.45 
446.41 
448.25 
448.41 
448.47 
448.43 



100.39 
108.31 
114.85 
117.01 
119.62 
119.28 



6.03 
9.11 
9.61 

9.98 



Best estimates: 



valence correlation 
inner-shell correlation 
Darwin&mass- velocity 
Atomic fine structure 
best TAE e 
ZPVE 
best TAE 



129.26 
0.08 
-1.88 
-1.97 
573.92 
8.03 
565.89 



Derivation of revised AH°, [Si(g 



AH 



/.2!)8 



#298 — Hq 



AH 



/.() 



Si(cr) § 
Si(s) § 
SiF 4 ( 9 ) § 
SiF 4 (g) H 
F(ff) [| 
Fa(ff) fl 




107.6±1.9 
-386.0±0.2 
-386.18±0.11 
18.97±0.07 




0.769±0.002 

1.8045±0.0002 

3.67±0.01 

1.5578±0.0002 
2.1092±0.0002 




106.5±1.9 
-384.7±0.2 
-384.86±0.13 a 
18.47±0.07 




Si(g) this work 



108.19±0.28 



107.15±0.38 a 



(a) CODATA values 0] for H298 — Hq have been employed 
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